Cascaded Amplitude Modulations in Sound Texture Perception

نویسندگان

  • Richard McWalter
  • Torsten Dau
چکیده

Sound textures, such as crackling fire or chirping crickets, represent a broad class of sounds defined by their homogeneous temporal structure. It has been suggested that the perception of texture is mediated by time-averaged summary statistics measured from early auditory representations. In this study, we investigated the perception of sound textures that contain rhythmic structure, specifically second-order amplitude modulations that arise from the interaction of different modulation rates, previously described as "beating" in the envelope-frequency domain. We developed an auditory texture model that utilizes a cascade of modulation filterbanks that capture the structure of simple rhythmic patterns. The model was examined in a series of psychophysical listening experiments using synthetic sound textures-stimuli generated using time-averaged statistics measured from real-world textures. In a texture identification task, our results indicated that second-order amplitude modulation sensitivity enhanced recognition. Next, we examined the contribution of the second-order modulation analysis in a preference task, where the proposed auditory texture model was preferred over a range of model deviants that lacked second-order modulation rate sensitivity. Lastly, the discriminability of textures that included second-order amplitude modulations appeared to be perceived using a time-averaging process. Overall, our results demonstrate that the inclusion of second-order modulation analysis generates improvements in the perceived quality of synthetic textures compared to the first-order modulation analysis considered in previous approaches.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phantoms in the brain: ambiguous representations of stimulus amplitude and timing in weakly electric fish.

In wave-type weakly electric fish, two distinct types of primary afferent fibers are specialized for separately encoding modulations in the amplitude and phase (timing) of electrosensory stimuli. Time-coding afferents phase lock to periodic stimuli and respond to changes in stimulus phase with shifts in spike timing. Amplitude-coding afferents fire sporadically to periodic stimuli. Their probab...

متن کامل

Between sound and perception: reviewing the search for a neural code.

This review investigates the roles of representation, transformation and coding as part of a hierarchical process between sound and perception. This is followed by a survey of how speech sounds and elements thereof are represented in the activity patterns along the auditory pathway. Then the evidence for a place representation of texture features of sound, comprising frequency, periodicity pitc...

متن کامل

Sound Texture Perception via Statistics of the Auditory Periphery: Evidence from Sound Synthesis

Rainstorms, insect swarms, and galloping horses produce "sound textures"--the collective result of many similar acoustic events. Sound textures are distinguished by temporal homogeneity, suggesting they could be recognized with time-averaged statistics. To test this hypothesis, we processed real-world textures with an auditory model containing filters tuned for sound frequencies and their modul...

متن کامل

The Evolution of Modern Texture Processing

This paper studies the evolution of image texture processing techniques over the last years Although texture is a fundamental attribute of images that has been shown to play an important role in human visual perception the quanti cation and characterization of texture is di cult Early texture processing techniques described texture deterministically or statistically in terms of repeated gray le...

متن کامل

The Role of Temporal Amplitude Modulations in the Political Arena: Hillary Clinton vs. Donald Trump

Speech is an acoustic signal with inherent amplitude modulations in the 1-9 Hz range. Recent models of speech perception propose that this rhythmic nature of speech is central to speech recognition. Moreover, rhythmic amplitude modulations have been shown to have beneficial effects on language processing and the subjective impression listeners have of the speaker. This study investigated the ro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 11  شماره 

صفحات  -

تاریخ انتشار 2017